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Abstract 

We determine the thresholds for the number of variables, number of 
clauses, number of clause intersection pairs and the maximum clause de- 
^ ■ gree of a fc-CNF formula that guarantees satisfiability under the assumption 

that every two clauses share at most a variables. More formally, we call these 
formulas a-intersecting and define, for example, a threshold iii{k,a) for the 
. , number of clause intersection pairs i, such that every a-intersecting fc-CNF 

■ formula in which at most ^i{k, a) pairs of clauses share a variable is satisfiable 
[ and there exists an unsatisfiable a-intersecting /c-CNF formula with firn{k, a) 

■ such intersections. We provide a lower bound for these thresholds based on 
CO I the Lovasz Local Lemma and a nearly matching upper bound by construct- 
\Q ■ ing an unsatisfiable fc-CNF to show that tJ,i{k,a) = e(2'''(2+i/Q)) Similar 

I thresholds are determined for the number of variables (/i„ — 6(2'^/")) and 

■ the number of clauses (/i,„ = 0(2'^(^+ = ^)) (see [TU] for an earlier but indepen- 
dent report on this threshold) . Our upper bound construction gives a family 
of unsatisfiable formula that achieve all four thresholds simultaneously. 

1 Introduction 

Satisfiability of CNF is one of the most studied and versatile problems in computer 
science with its own journal (JSAT), competitions and an yearly conference, Inter- 
national Conference on Theory and Applications of Satisfiability Testing (SAT). 
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In this paper we investigate a simple class of criteria that can guarantee satisfia- 
bility of a given /c-CNF formula. We consider threshold criteria, i.e., for several 
quantities connected to a CNF (like the number of clauses, variables or variable 
intersections) we determine a maximum magnitude leading to satisfiable formulas. 
We would like to determine the exact threshold of such quantities, in the sense 
that there exist unsatisfiable formulas for which this quantity is greater than the 
threshold. A tightly determined threshold can be used as a simple satisfiability 
test: given a formula F, determine or count the specific quantities in F and de- 
clare F satisfiable if one of these quantities is below the threshold. Observe that 
such thresholds help in deciding satisfiability only if the considered quantity are 
below the threshold. The problem of deciding satisfiability when all these quan- 
tities are above the threshold is still a hard problem. 

One such threshold that we consider is the number of clauses m. We denote this 
threshold by Umik) and it denotes the smallest number of clauses in an unsatisfi- 
able formula. The trivial lower bound of ^im{k) > 2*^ is easily seen: each formula 
that consists of less than 2^^ clauses is satisfiable since each clause eliminates only 
one out of the 2^^ possible satisfying assignments. On the other hand there is an 
unsatisfiable A;-CNF formula with 2^ clauses, namely the formula consisting of 
all possible 2^ clauses (all positive/negative literal combinations) on k variables. 
Hence, /i„(A:) = e(2'^-)- 

Yet another prominent threshold is the maximum clause degree A of a /c-CNF for- 
mula, i.e. the maximum number of clauses that share at least one variable with 
a fixed clause. The complete formula on k variables once again has maximum 
degree 2^ and gives an easy upper bound for this threshold. On the other hand 
an application of the powerful Lovasz Local Lemma [3j shows that every formula 
with A < 2'^/e is satisfiable leading to the conclusion that /ua = 0(2^). 

In this paper we focus on satisfiability-threshold for a special class of formulas 
which guarantee that two clauses intersect only in a bounded number (henceforth 
we denote this by a) of variables. These formulas are a natural extension of 
linear CNF formulas, i.e., formulas with a = 1, which have been introduced in [9]. 
The naming and concept of linear CNF formula comes from hypergraphs with 
bounded intersections as studied for example in [3]. Intuitively, the restriction 
to bounded intersection makes it harder to build conflicting clauses which lead 
to unsatisfiability. And indeed it was the original goal of the authors to prove 
a higher satisfiability-threshold for A in linear /c-CNF using stronger versions 
of the LLL, e.g., the soft-core LLL version of |13j . While it turned out that the 
satisfiability threshold for A remains 0(2'^) even for linear CNFs we got interesting 
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dependencies on a in the thresholds for other quantities, namely the number of 
variables, the number of clauses and the number of clause intersection pairs. 

2 Related work 

This paper builds highly on the techniques developed by Erdos and Lovasz in the 
classical paper "Problems and results on 3-chromatic hypergraphs and some re- 
lated questions" [3|. Our proofs are built on the powerful Lovasz Local Lemma and 
also make use of and extend the shrinking operation (see Section [5|) that was used 
in [3] to construct interesting linear hypergraphs. Independently but roughly a 
year before the authors conducted this research the paper [10] by Dominik Scheder 
examined the satisfiability threshold for the number of clauses/constraints apply- 
ing essentially the same techniques as here and in While Scheder considers 
multi-value constraint satisfaction problems - essentially a non-binary variant of 
CNF formula — he restricts himself to the threshold [ira- All results presented here 
directly extend to these multi-value CSPs too and to our knowledge this paper is 
the first to states the thresholds for the number of clause intersection pairs, vari- 
ables and the max degree explicitly. More complicated algebraic constructions 
based on ideas of Kuzjurin [6] and Kostochka and Rodl [5] work for the restricted 
case a = 1 and can be found in Lemma 2.2. of [11] without explicit statement of 
thresholds. Most notably, we use the a-shrinking procedure not just in the lower 
bound but apply it to a maximal {k -|- a)-uniform a-intersecting formula in our 
upper bound construction. This is the key to obtaining bounds on the number 
of clause intersections and gives an unsatisfiable a-intersecting formula that is 
extremal (up to log-factors) in all considered quantities. 

Another very interesting related work by Scheder and Zumstein is the paper "How 
many conflicts does it need to be Unsatisfiable" [12] in which upper and lower 
bounds on the threshold for conflicts are given. The notion of a conflict is closely 
related to clause intersections. Instead of counting the pairs of clauses that share 
a variable the number of conflict only counts clause pairs in which at least one 
variable is shared in an opposite literal. The reason why conflicts are interesting 
is because the lopsided version of the Lovasz Local Lemma [4j can be applied to 
A:-CNF formulas in which each clause is involved in at most 2'^/e conflicts and thus 
guarantees their satisfiability. In contrast to the nearly tight threshold [ii{k^ a) = 
Q^2^i'2+i/a)-j £qj, clause intersections in a-intersecting formula established here, 
the conflict threshold is much harder to determine: the best known result for 
a = A: is a;(2.69^) < fXc{k,k) < 0(#^) [12]. 
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3 Preliminaries 



A hypergraph is k-uniform if all edges contain exactly k vertices. Two edges 
are called intersecting if they share at least one vertex and a hypergraph is 
called a-intersecting if any two intersecting edges share at most a vertices. A 
1-intersecting hypergraph is called linear. The edge intersection pairs of a 
hypergraph are all pairs of edges that are intersecting. The degree of a vertex 
is the number of edges it appears in and the degree of an edge is the number 
of edges it intersects with. 

Every fc-CNF formula F induces a A;-uniform (multi)-hypergraph Gp = {y-,E) 
where V is the set of variables and the edge (multi)-set E contains an hyperedge 
over vertices {vi, ■ ■ ■ , tifc} if and only if there exists a clause consisting of the cor- 
responding variables. This gives a one-to-one mapping between clauses and edges 
in the induced hypergrah and we adopt all previously introduced hypergraph ter- 
minology for fc-CNF formula accordingly, e.g., we define clause intersection pairs 
as all pairs of clauses that intersect in at least one variable. 

Throughout this paper we are interested in satisfiability thresholds for a-intersecting 
fc-CNF formula. We consider the following quantities: number of clauses m, 
number of variables n, maximum degree A and number of clause intersection 
pairs i. Denote the thresholds for a quantity q with fiq{a,k). A satisfiability 
threshold fJ-q{a, k) is the smallest number such that there exists an unsatisfiable 
a-intersecting /c-CNF with q = k). Phrased differently it is the largest num- 

ber such that every a-intersecting fc-CNF formula with q < ^q{a, k) is satisfiable. 

Our lower bounds to the thresholds are based on a classical application of the 
Lovasz Local Lemma [3] and its more recent constructive algorithmic versions 
that give randomized [7] and deterministic [U [1] algorithms: 

Theorem 1. Every k-CNF with maximum clause degree A at most — is satisfi- 
able and there is an efficient algorithm to find such an assignment. 

4 Results 

We present lower bounds (Theorem [2]) and nearly matching constructive upper 
bounds (Theorem [3]) that determine all thresholds /ij, /i^, Hn-, fJ-A up to log- factors 
(Theorem S]). Our lower bound in Theorem [2] consists of an algorithm based on 
Theorem [T] that efficiently finds a satisfying assignment for any a-intersecting k- 
CNF formula with few clause intersection pairs, variables or clauses. The upper 
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bound in Theorem [3] proves the existence of unsatisfiable formulas which have 
only slightly more clause intersections, variables and clauses. Note that while 
our proof of Theorem [2] is algorithmic, one needs an efficient implementation of 
Lemma [7] to make Theorem [3] constructive(see also [H]). We suspect that some 
of the bounds below can be improved by 0(fc)-factors but since all bounds are 
exponential in k we did not optimize for these polylogarithmic factors. 



Theorem 2. Every a-intersecting k-CNF with less than 

(2+1/q) 

2a \ ek 

or 



1 / 2('^~°) 

Lj = — 1 clause intersections 



or 



Ln = variables 

\ I 

Lm = T ; — clauses 

k \ ek I 

is satisfiable and a satisfying assignment can be found efficiently. 

Theorem 3. For any k and a < k there is an unsatisfiable a-intersecting k-CNF 
with at most 

Ui = a22(fc+")(2+i/a)^(5+2/a) ^^^^^^ intersections 

and 

Un = 2a2^/°A;2(i+V") variables 

and 
and 

C/a = a2^'^^"'^k^ maximum degree. 



In the following Q{x) means 0(x(logx)^) for some absolute (positive or negative) 
constant c. Combining the above two theorems yields good estimates for the 
thresholds: 

Corollary 4. The thresholds for satisfiability are: 
• number of clause intersections: jii = B(2^(^+-'^/")) 
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• number of variables: Hn = 0(2^^/") 

• number of clauses: Hm = G(2^*^^^«)) 

• maximum degree: ha = 

5 Shrinking and Maximal a-intersecting Hypergraphs 

This section contains useful lemmas about hypergraphs needed to prove the main 
theorems. One operation that will be particularly helpful for both the lower and 
the upper bound is the /^-shrinking operation. The shrinking operation creates 
a fc-uniform hypergraph H' from a (A; + /3)-uniform hypergraph H by deleting the 
/3 vertices of maximum degree from each edge breaking ties arbitrarily. Shrinking 
is similarly defined for (k + /3)-CNF formulas where the variables with highest 
degree are deleted from each clause. The next lemma shows that a high degree 
vertex can survive the ^-shrinking procedure to remain a high degree vertex only 
if many such high degree vertices are present in the original hypergraph. 

Lemma 5. Let H be a [k + a)-uniform a-intersecting hypergraph and H' be the 
result of a-shrinking H. If H' has a vertex of degree d, then H has more than 
ji/a y^rtices of degree at least d. 

Proof. Let v be the vertex in H' of degree d. Since H' was created by shrinking H 
there are at least d edges in H in which v is present but did not get deleted. We 
call the set of those edges £; then we know that \£\ > d. Prom each edge e ^ £, 
exactly a vertices got deleted all of which are of degree of at least d. We claim 
that the mapping that maps each e € 5 to this a-sized set of deleted vertices is 
injective: 

Suppose two edges ei,e2 € £ get mapped to the same a-sized set of vertices. 
Then, the edges ei and 62 intersect in these a vertices; furthermore they also in- 
tersect in the vertex v and thus intersect in a -|- 1 vertices. This is a contradiction 
to the a-intersecting property of H. 

Injectivity gives us that there are \£\ > d different a-sized subsets of vertices 
which got deleted instead of v while shrinking. All vertices in those subsets must 
have degree at least d by definition of the shrinking operation. Furthermore if N 
is the number of distinct vertices in those subsets then we have d< (^) < A^". 
Therefore there are at least N > c/^/" vertices with degree at least d in H. □ 
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The next lemma proves that any maximal a-intersecting hypergraph on n vertices 
must have a large number of edges. It uses a bound on the Turan number that is 
due to de Caen [2]. The Turan number T(n, k, r) for r-uniform hypergraphs with 
n vertices is the smallest number of edges possible such that every set of k vertices 
contains at least one edge. This number was determined for graphs by Turan [Hj 
and extended to hypergraphs by himself in the report "Research Problems" [15]. 

Lemma 6. Every maximal a-intersecting hypergraph H on n vertices has at least 
fn > ^^^a ^dges. 

\a + l) 

Proof. Let H he a maximal a-intersecting hypergraph on m edges. Since H is 
a-intersecting each of the {Jli) subsets of vertices of size a -|- 1 is covered by at 
most one distinct hyperedge oi H. Also, H covers exactly it^{J^i) distinct subsets 
of size a + 1 in H. If < T{n,k,a + 1) the a + 1-uniform hypergraph 

consisting of all covered a + 1-size subsets has less than T{n, A;, a + 1) edges and 
therefore 3 a /c-subset K that does not contain any covered edge. This /c-subset 
can be added as an edges into H while preserving it to be a-intersecting. Indeed, 
if some edge e intersects K in at least a + l vertices, then the corresponding set 
of vertices is covered contradicting the choice of K. Thus if m < "^^"'^'"^"^^ then 

H is not maximal a-intersecting. To finish we use a lower bound of de Caen [2j 
on the Turan number: T{n, k,a + 1) > ""^^^ (a+i) / ('^a^)' Plugging this in gives 
the desired result. □ 

We remark that the same result also appears in Scheder [TOj with somewhat 
simpler and self-contained proof. 

6 A Constructive Lower Bound 

This section gives the proof for the lower bound in Theorem [2 
Proof, (of Theorem [2|) 

We prove that every a-intersecting /c-CNF F is either satisfiable by Theorem [T] af- 
ter a-shrinking it or it must have large clause intersection pairs, variables, clauses 
and a high maximum degree contradicting the hypothesis about the formula F. 

Let F' be the resulting (k — a)-CNF we get from a-shrinking F. If all variables 
in F' have degree less than d = 2^''~°'^ /ek then the Lovasz Local Lemma guaran- 
tees that F' is satisfiable and Theorem [1] states that a satisfying assignment can 
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be efficiently found. Note tliat a satisfying assignment for F' is also a satisfying 
assignment for F. 

In the other case, suppose F' has at least one variable of degree d. Then, Lemma 
shows that F must have at least d^/" variables of degree at least d. 

To count the number of clause intersection pairs in F, we count the intersections 
of clauses containing one of the high degree variables. For each such variable 
the clauses containing it induce a clique with (d — 1)^/2 intersections. Taking the 
disjoint union of these intersections we get at least {d — l)^"'"^/°/2 intersections 
but overcount each intersection up to a-times since two clauses can intersect in 
up to a variables. Therefore F has at least -^{d — 1)^+^/"^ intersections. 

To count the number of clauses in F we look at the union of the clauses containing 
one of the d^/" variables. There are at least d^+^Z" clauses in the non disjoint 
union and each clause can get added because of each of its k variables at most 
once. Thus F has at least d!^^^l°^ jk clauses. 

Finally it is clear that F has at least d^/" variables. □ 

7 Upper bounds for the thresholds 

This section gives the proof for the upper bounds in Theorem El 

Before we prove the theorem itself, the following lemma gives a general way to 
transform a sufficiently dense A;-uniform hypergraph into an unsatisfiable fc-CNF 
formula by iteratively taking a hyperedge and greedily choosing positive or nega- 
tive literals for the variables: 

Lemma 7. // there is a k-uniform hypergraph H on n vertices and at least m = 
n2^ edges than there exists an unsatisfiable k-CNF F inducing H. 

Proof. Denote the vertices in H by vi, . . . , u„ and associate with them, the vari- 
ables xi, • • • , that will occur in F. 

We will denote A € {0, 1}" to be an assignment if we pick an assignment of val- 
ues to variables x by setting Xi = Ai- We say that a clause covers an assignment 
A € {0, l}*^ if it is not satisfied by the assignment. We will iteratively create 
a clause for every edge in H greedily covering the maximum number of yet un- 
covered assignments. We have to show that in the end all 2" assignments are 
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covered. Consequently, the conjunction of the created clauses forms an unsatisfi- 
able k-CNF. 



We pick edges e from H in an arbitrary order. We want to create a clause for 
e on the k variables associated with the k vertices in e. For each variable we 
have the choice to pick the positive or the negative literal. These are 2^^ different 
choices and the assignments covered by two different choices are disjoint. Since 
every assignment can be covered in this way the assignments get partitioned into 
2^ parts. Simple averaging then guarantees that there exists a choice covering at 
least 1/2^ fraction of the assignments not covered so far. After m iterations of 
greedily creating clauses covering the maximal number of uncovered assignments 

is at most 2" (l - 1/2'=)"' = ( (^.^-^^a-fc )" < 1- With all assignments covered 
the created formula F is unsatisfiable and by construction also induces H as 
required. □ 



The above lemma shifts the focus towards finding a suitable dense A;-uniform hy- 
pergraph in order to find an unsatisfying fc-CNF. The following proof of Theorem 
[2] shows that a-shrinking a maximal a-intersecting (k + a)-uniform hypergraph 
results in hypergraphs with nice additional extremal properties. Furthermore 
choosing a large number of vertices results in hypergraphs that obey the bound 
in Lemma [7] and can thus be transformed into the desired unsatisfiable fc-CNF. 

Proof, (of Theorem [3|) 

We create the formula by applying Lemma [7] to an a-intersecting hypergraph. We 
obtain this hypergraph by a-shrinking a maximal a-intersecting (k + a)-uniform 
hypergraph. Observe that it makes the resulting hypergraph fc-uniform. 

We choose n = a (2^~^°'k'^^°'~^^'^^^^°' and build a a-intersecting (k + a)-uniform 
hypergraph on n vertices. The choice of n is such that Lemma [6] guarantees that 
we can find a A; + a-uniform hypergraph H with 



n 



m = = a2('=+")(i+Va)A:2(i+i/-) = 2'=+"n 



edges. This is sufficiently large number of edges to construct an unsatisfiable for- 
mula F for hypergraph H using Lemma [71 Having constructed H, we a-shrink it 
to obtain hypergraph H' and its correspoding formula F' . Note that F' is unsat- 
isfiable because F is unsatisfiable. The significant advantage about H' obtained 
this way is that it has guarantees on the maximum degree and on the number of 
clause intersections. More precisely, we claim that H' has maximum degree less 
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than {■mk)^^^^^^/"\ Suppose that after the shrinking there is a vertex of degree 
d > {m{k + a))-'^/^^^^/") . Lemma [5] shows that in this case H contains at least 
cji/" vertices of degree larger than d. The disjoint union of the edges containing 
those vertices has size at least d^+^Z" and each edge gets counted at most {k + a) 
times this way. Therefore H would have at least d^^^/"/{k + a) > m edges — a 
contradiction. 

Lemma [7] transforms the hypergraph H into an unsatisfiable /c-CNF formula F. 
This formula has n variables and m edges since shrinking preserves these quanti- 
ties. Furthermore, the maximum degree A of F is at most (mk)^^^^^^^"^ which 
also implies that the number of clause intersections is at most mA. □ 
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